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Abstract 

Using silicon-based recording electrodes, we recorded neuronal activity of the 
dorsal hippocannpus and dorsonnedial entorhinal cortex fronn behaving rats. 
The entorhinal neurons were classified as principal neurons and interneurons 
based on nnonosynaptic interactions and wave-shapes. The hippocannpal 
neurons were classified as principal neurons and interneurons based on 
nnonosynaptic interactions, wave-shapes and burstiness. The data set contains 
recordings fronn 7,736 neurons (6,100 classified as principal neurons, 1,132 as 
interneurons, and 504 cells that did not clearly fit into either category) obtained 
during 442 recording sessions fronn 1 1 rats (a total of 204.5 hours) while they 
were engaged in one of eight different behaviours/tasks. Both original and 
processed data (tinne stannp of spikes, spike wavefornns, result of spike sorting 
and local field potential) are included, along with nnetadata of behavioural 
nnarkers. Connnnunity-driven data sharing nnay offer cross-validation of findings, 
refinennent of interpretations and facilitate discoveries. 
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Introduction 

The hippocampus and entorhinal cortex are essential structures 
for memory and spatial navigation ^ l Position-tuned cells ('place 
cells') are present in CAl, CAS and dentate gyrus regions^ *^. Grid 
cells, head direction cells, and border cells have been described in 
the dorsomedial entorhinal cortex, and are critical ingredients of 
navigation systems^'^'^'^^"^^ The temporal coordination across the 
entorhinal cortex and hippocampus is secured by various oscilla- 
tions, especially theta, gamma and sharp wave ripples^^ -^ 

We recorded activity of neurons in these brain regions while ani- 
mals performed various tasks, such as linear track, open maze, 
T-maze with wheel running delay, plus maze and zigzag maze, as 
well as recordings during sleep in the home cage. Extensive tech- 
nical descriptions of the data sets described in this document are 
available in several published papers^''^^"^^. 

Several questions related to memory, navigation, spike time pat- 
terns, population coding, neuronal interactions, neuronal classifica- 
tion, replay, sleep homeostasis and oscillations have been studied 
based on this dataset^''-^"^^ However, this dataset may provide valua- 
ble information if subjected to yet further analyses. Improved spike 
sorting, neuron classification and more sophisticated analyses may 
extend and refine the initial conclusions and offer insights that were 
previously missed. For these reasons we provide both unprocessed 
(wide band) and processed versions of our data. In our experience, 
all methods have limitations and must undergo continuous revision. 
We believe that community-driven data sharing, cross-validation of 
data, unified data formats and large collaborative efforts will facili- 
tate discovery and benefit future progress in neuroscience. 

Material and methods 

Animal surgery 

All protocols were approved by the Institutional Animal Care and 
Use Committee of Rutgers University (protocol No. 90-042), and 
all experiments were performed at Rutgers University. Before sur- 
gery, one to four rats were housed in a single home cage (made of 
plastic; size L = 45 cm, W = 23.5 cm, H = 20 cm). Wood shavings 
were used as bedding and dry pellets were provided as food. The 
animals were housed in a temperature controlled (68 °F), but not a 
specific pathogen free, environment under 12:12-hours light:dark 
cycle where light cycle was from 7AM to 7PM. After surgery, the 
rats were housed individually, and highly absorbent paper (Tech- 
board, Shepherd Speciality Papers) was used as bedding, and the 
animal's health was assessed daily by the experimenters. 

Details of surgery and recovery procedures have been previously 
described in detail^-'^\ Eleven Long Evans rats (male, 3-8 months 
old, 250-400 g) were deeply anesthetized with isoflurane (1-1.5%). 
In two rats (f01_m and g01_m), two silicon probes were implanted 
(one in each hemisphere) and targeted CAl region. In three rats 
(gorOl, pinOl and vvpOl), two probes (32- and/or 64-site silicon 
probes) were implanted in the left dorsal hippocampus, targeted 
to CAl and CA3 separately, and advanced over sessions and days 
through overlying neocortical and hippocampal tissue. The probe 
positions were: rat pinOl: CA3: at a 35 degree angle to coronal 
plane, centered on 2.8 mm posterior and 2.6 mm lateral to bregma. 



CAl: 26.5 degree angle to vertical, at a 35 degree angle to coro- 
nal, centered on 4.6 mm posterior and 2.4 mm lateral to bregma; 
rat vvpOl: CA3: at a 26.5 degree angle to coronal plane, centered 
on 2.8 mm posterior and 2.6 mm lateral to bregma. CAl: 26.5 
degree angle to vertical, parallel to sagittal plane, centered on 4.4 
mm posterior and 2.3 mm lateral to bregma; rat gorOl: CA3: at a 
26.5 degree angle to coronal plane, centered on 3.1 mm posterior, 
and 3.0 mm lateral to bregma. CAl: 26.5 degree angle to vertical, 
at a 45 degree angle to coronal plane, centered on 4.9 mm pos- 
terior and 1.5 mm lateral to bregma. In four rats (ec013, ec014, 
ec016 and i01_m), 32- or 64-site silicon probe(s) were implanted 
in the right dorsal hippocampus and recorded from CAl, CA3 or 
dentate gyrus, and another 4-shank silicon probe was implanted in 
the right dorsocaudal medial entorhinal cortex. In one rat (ec012), 
one 4-shank silicon probe was implanted in the right dorsocaudal 
medial entorhinal cortex. In rat ec012, ec013, ec014, and ec016, the 
probe targeting the entorhinal cortex was positioned such that the 
different shanks recorded from different layers-^ (4.5 mm lateral 
from the midline; 0.1 mm anterior to the edge of the transverse 
sinus at a 20-25 degree angle in the sagittal plane with the tip point- 
ing toward the anterior direction). In rat i01_m, the EC probe had 
4 shanks and was positioned such that all shanks recorded from 
the same layer. For the hippocampus probe in rats ec013, ec014 
and ec016, the shanks were aligned parallel to the septo-temporal 
axis of the hippocampus (45 degrees parasagittal), positioned cen- 
trally at 3.5 mm posterior from bregma and 2.5 mm lateral from 
the midline. 

For all silicon probes used, each shank had eight recording sites 
(160 |am^ each site, 1-3-MQ impedance), and intershank distance was 
200 |am. Recordings sites were staggered to provide a two-dimensional 
arrangement (20 jam vertical separation)^^'^\ The individual silicon 
probes were attached to respective microdrives and moved inde- 
pendently and slowly to the target. Two stainless steel screws 
inserted above the cerebellum were used as indifferent (reference) 
and ground electrodes during recordings. At the end of the physi- 
ological recordings during the behavioural tasks, a small anodal DC 
current (2-5 |iA, 10 s) was applied to recording sites 1 or 2 days 
before rats were deeply anesthetized and euthanized by perfusion 
with 10% formalin solution. The positions of the electrodes were 
confirmed histologically and reported previously in detaiP^'^^. 

Behavioural testing 

After the animals recovered from surgery (1 to 2 weeks), physio- 
logical signals were recorded during eight different types of behav- 
iours mostly during light cycles (see Table 1). 

(1) On an elevated linear track (250 cm x 7 cm), the animals 
were required to run back and forth to obtain water reward 
at both ends-^ In three animals (gorOl, pinOl, and vvpOl), 
a similar elevated track was used (170 cm x 6.2 cm, with 
22 cm X 22 cm end platforms) that was shortened to 79 or 
125 cm in some trials- 

(2) In the open field task, the rats chased randomly dispersed 
drops of water or pieces of Froot Loops (25 mg; Kellogg's) 
on an elevated open platform-^ (180 cm x 180 cm, 120 cm 
X 120 cm or 100 cm x 200 cm). 
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Table 1. Behaviour descriptions. 



Behaviour 


Denaviour suDciass 

(Behaviour 

identifier) 


Description 


elevated linear track 


linear 


Linear track, 250 cm x 7 cm. H 


elevated linear track 


linearOne 


Linear track (170 cm x 6.2 cm, with 22 cm x 22 cm end platforms) that 
was shortened to 79 or 125 cm in some trials^^'^^ (Usually shortened but 
sometimes also lengthened). The same linear track was used in linearOne 
and linearTwo but at different locations in the same recording room. The 
center of the track was at the same position for linearOne and linearTwo, but 
the track was at fixed 36.9 degree angles from each other, corresponding to 
the diagonals of the 480 x 640 pixel camera. 


elevated linear track 


linearTwo 


Exactly the same as linearOne but the linear track was at different locations 
in the same recording room. See linearOne. 


open field 


bigSquare 


180 cm X 180 cm. 


open field 


bigSquarePlus 


180 cm X 180 cm square open field, divided by plus shaped walls put in the 

ppntpr nf thp fipld 

Ov^l ILOI ^1 LI \\^ IIOIV^. 


nnpn field 


midRni jarp 

1 1 1 1 U Cll \^ 


1 20 cm X 1 20 cm. 


nnpn fiplrl 


Open 


1 no cm X 200 cm 

1 \J\J Ol 1 1 /\ L—\J\J Ol 1 1 


rewarded wheel- 

ri inninn tflcik 

1 ui II III ly LCior\ 


wheel 


Operant wheel running task. See Mizuseki etal., 2009^^ 


alternation task in 

ClILv^l 1 IClLIV_yi 1 LCIOIX II 1 

T-maze 


Mwheel 


Alternation task in T-maze (100 cm x 120 cm) with wheel running delay. See 
Pastalkova etal., 2008^ 


alternation task in 
T-maze 


Tmaze 


Alternation task in T-maze, the same as Mwheel but without delay period. 
There were 2.78 camera pixels/cm, which converts to 22.24 units/cm for the 
.whi files (8x compression of pixels). 


elevated plus maze 


plus 


Plus maze. 100 cm x 100 cm. 


zigzag maze 


Zigzag 


100 cm X 200 cm zigzag maze. See Royer etal., 2010^^. 


wheel-running in 
home cage 


wheeLhome 


Wheel running in home cage with free access to a wheel with no 
reinforcement. 


sleep 


sleep 


Sleeping in home cage. 



(3) In the rewarded wheel-running task, a wheel (diameter = 29 
cm) was attached to a rectangular- shape box (39 cm x 39 cm 
X 39 cm). The rat was required to run in the wheel continu- 
ously for 10 seconds, after which time a piece of Froot Loop 
was dropped in the box as reinforcement- ^ 

(4) In the alternation task in the T-maze (100 cm x 120 cm) 
with wheel running delay, the animal was required to run on 
a wheel attached to the waiting area for 10 sec, after which 
time the animal had access to the central arm of the T-maze, 
at the end of which the animal chose to turn right or left. The 
animal was rewarded with water if the choice was opposite 
to the previous one^. 

(5) In the elevated plus maze (100 cm x 100 cm), the rats were 
motivated to run to the ends of four corridors, where water 
was given every 30 s. 

(6) In the zigzag maze (100 cm x 200 cm) with 11 corridors, 
the animals learned to run back and forth between two water 
wells; 100 jil of water was delivered at each welP^ --'^^'^^. 

(7) In the wheel-running in home cage, a wheel (diameter = 29 
cm) was attached to a rectangular- shape box (39 cm x 39 cm 
X 39 cm) which was used as a home cage during the experi- 
ment. Rats had free access to the wheel, and ran on the wheel 
with no reinforcement. 



(8) In the sleeping session, the rat slept in the home cage. 

For recording of behaviour (1) to (6), animals were water- scheduled 
for 23 hours prior to the experiment. Otherwise, both dry food and 
water were provided ad libitum. For tracking the position of the 
animals, two small light-emitting diodes, mounted above the head- 
stage, were recorded by a digital video camera (SONY) at 30 Hz 
resolution. 

Data collection and cell-type classification 

Detailed information about the recording system and spike sorting 
has been previously de scribed- Briefly, signals were amplified 
(l,000x), bandpass-filtered (1 Hz-5 kHz) and acquired continu- 
ously at 20 kHz (DataMax system; RC Electronics) or 32,552 Hz 
(NeuraLynx, MT) at 16-bit resolution. After recording, the signals 
were down-sampled to 1,250 Hz (DataMax system) or 1,252 Hz 
(NeuraLynx system) for the local field potential (LFP) analysis. 
In electrophysiological recordings, positive polarity is from zero 
toward positive values. To maximize the detection of very slowly 
discharging ('silent') neurons^^ clustering was performed on con- 
catenated files of several behavioural and sleep sessions recorded 
at the same electrode position on the same recording day^^'^^"^^. We 
made extensive use of publicly available analytical and display pro- 
grams, which were developed in our laboratory (KlustaKwik^^ avail- 
able at http://sourceforge.net/projects/klustakwik/, Neuroscope^'^ 
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available at http://neuroscope.sourceforge.net/, Klusters"^"^ avail- 
able at http://klusters.sourceforge.net/, NDmanager'^'^ available at 
http://ndmanager.sourceforge.net/). The latest available version at 
the time was used in each case. Spike sorting was performed auto- 
matically, using KlustaKwik^^ followed by manual adjustment of 
the clusters, with the help of autocorrelogram, cross-correlogram and 
spike wave- shape similarity matrix (Klusters software package^'^). 
Because none of the existing spike sorting algorithms is completely 
automated, manual adjustment is necessary^l This inevitably leads 
to some operator-dependent variability^^; therefore, provided clus- 
ters are not always identical to those used in our previous publica- 
tions. Hippocampal principal cells and interneurons were separated 
based on their burstiness, waveforms and short-term monosynaptic 
interactions^' Classification of principal neurons and interneu- 
rons of entorhinal cortical neurons was based on waveforms and 
short-term monosynaptic interactions, and described previously in 
detaiP^ A total of 3,113 (CAl), 882 (CA3), 66 (DG), 491 (EC2), 568 



(EC3) and 551 (EC5) principal neurons and 420 (CAl), 198 (CA3), 
52 (DG), 85 (EC2), 215 (EC3) and 91 (EC5) interneurons were 
identified and included in this data set (see Table 2-Table 4). 

The tip of the probe either moved spontaneously relative to the 
brain or was moved by the experimenter between recording days 
to record from potentially different sets of neurons. However, we 
cannot exclude the possibility that some neurons recorded on dif- 
ferent days were identical, because spikes recorded on each day 
were clustered separately, though in some instances neurons were 
recorded over multiple days. When we moved the electrodes, we 
waited for at least an hour before recording in order to stabilize the 
position of electrodes. 

Data description 

The data are available^^ at CRCNS.org (http://dx.doi.org/10.6080/ 
K09G5JRZ). Details of the data collection, processing and storage 



Table 2. Number of cells recorded. Top row: animal identifier. Left column: brain region. Brain region EC4 indicates 
either entorhinal cortex layer 3 or 5 (could not be determined which); region EC? indicates in entorhinal cortex, but 
without layer assignment. 



Brain 
region 

EC2 

ECS 

EC4 

ECS 

EC? 

Total EC 



ec012 ec013 ec014 ec016 f01_m g01_m gor01 i01_m j01_m pin01 vvp01 total 



201 



110 



311 



311 
362 
276 
416 



180 
177 

68 



1365 425 



112 
116 
57 
154 

439 



82 
82 



603 
856 
333 
748 
82 

2622 



CAl 
CA3 
DG 

Unknown 
Total 



311 



1185 
223 
41 
39 

2853 



1136 661 99 145 
646 
94 

1561 1840 99 145 



50 
153 



203 



309 



3 

394 



90 
90 



23 
45 



68 



116 
56 



172 



3724 

1123 

135 

132 

7736 



Table 3. Number of principal cells. Top row: animal identifier. Left column: brain region. 



Brain 
region 


ec012 


ec013 


ec014 


ec016 


f01_m 


g01_m 


gor01 


i01_m 


j01_m 


pin01 


vvp01 


total 


EC2 




248 


146 


97 
















491 


EC3 


140 


239 


101 


88 
















568 


EC4 




214 




46 
















260 


EC5 


89 


300 


34 


128 
















551 


EC? 
















51 








51 


Total EC 


229 


1001 


281 


359 








51 








1921 




CM 




887 


995 


577 


79 


131 


42 


289 




19 


94 


3113 


CA3 




217 




443 






138 






41 


43 


882 


DG 




18 




48 
















66 


Unknown 




37 












1 


80 






118 


Total 


229 


2160 


1276 


1427 


79 


131 


180 


341 


80 


60 


137 


6100 
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Table 4. Number of interneurons. Top row: animal identifier. Left column: brain region. 



Brain 
region 


ec012 


ec013 


ec014 


ec016 


f01_m 


g01_m 


gor01 


i01_m 


j01_m 


pin01 


vvp01 


total 


EC2 




45 


27 


13 
















85 


ECS 


S7 


89 


66 


23 
















215 


EC4 




31 




8 
















39 


ECS 


16 


36 


20 


19 
















91 


EC? 
















24 








24 


Total EC 


53 


201 


113 


63 








24 








454 




CA1 




205 


90 


46 


19 


13 


8 


14 




3 


22 


420 


CAS 




4 




174 






14 






2 


4 


198 


DG 




16 




36 
















52 


Unknown 




1 












1 


6 






8 


Total 


53 


427 


203 


319 


19 


13 


22 


39 


6 


5 


26 


1132 



Table 5. Number of recording sessions. Top row: animal identifier. Left column: behaviour subclass. 



Behaviour 
subclass 


ec012 


ec013 


ec014 


ec016 f01_m 


g01_m 


gor01 


101 _m j01_m 


pin01 


vvp01 


total 


bigSquare 


24 


45 


4 


13 






1 4 






91 


bigSquarePlus 




2 
















2 


linear 


18 


90 


2 


9 












119 


linearOne 












3 






5 


8 


linearTwo 












3 






5 


8 


midSquare 




4 


8 


2 












14 


Mwheel 


28 


16 


8 


14 8 


7 




8 






89 


Open 


















3 


3 


plus 




11 
















11 


sleep 






19 


10 










1 


30 


Tmaze 












2 




3 


1 


6 


wheel 




40 


8 


9 




1 








58 


wheeLhome 








2 












2 


ZigZag 






1 














1 



Total 70 208 50 59 8 7 9 9 4 3 15 442 



of data into files are included with the data set, including scripts 
useful for processing the data^^\ Here, we briefly summarize the 
data description. 

The number of cells recorded from each animal and brain region is 
shown in Table 2. 

Most of the recorded cells were classified as principal neurons 
or interneurons. The number of cells classified as principal and 
interneuron are shown in Table 3 and Table 4. 

The 8 types of behaviours (see Behavioural Testing section) were 
further subdivided into 14 behaviour subclasses based on minor 
differences (e.g. size of maze) and used as behaviour identifiers in 
the dataset (Table 1). 



The data were obtained during 442 recording sessions. During each 
session the animal performed one of the 14 behaviour subclasses. 
The number of recording sessions and behaviour subclasses used 
with each animal is shown in Table 5. The description of each 
behaviour subclass is given in Table 1. 

Data file organization 

The data files for each recording session are stored in separate com- 
pressed tar archive files (i.e. with extension "tar.gz"). These files 
are organized into top-level directories, each of which contains data 
for sessions recorded on the same day using the same animal and 
electrode placement combination. Data from all sessions recorded 
from the same animal on the same day were merged for spike sort- 
ing. All merged sessions are stored in the same top-level directory 
in the data set at CRCNS.org. Therefore, the cell identification 
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numbers assigned by the spike sorting are common to all sessions 
within a top-level directory, and are not specific to individual ses- 
sions. Details of the file organization are provided in the document 
"CRCNS.org hc3 data description" which is included with the 
data set. 

Metadata organization 

The metadata describing the data is stored in four tables that are 
included with the data set. Table cell has information about each 
spike sorted cell. Table session has information about each experi- 
mental session. Table epos contains information about the position 
of the electrodes. And table file has information about the ".tar.gz" 
and other files that are in the data set. 



These tables are provided in CSV (comma- separated values) 
format, Excel format, and as tables in an SQLite database. SQLite 
(http://www.sqlite.org/) is a free, open source, SQL data base 
engine available for all common operating systems. These tables 
are related to each other through a field (named "topdir"), which 
has the name of top-level directories described above and is com- 
mon to all four tables. The fields in each of these tables are listed in 
Listing 1. As described in file "CRCNS.org hc3 data description" 
the SQLite command interface can be used with these tables to gen- 
erate summary statistics from the metadata and to locate data files 
that satisfy particular search criteria (for example, find data for cells 
of a specific type from a particular brain region and experimental 
behaviour). 



Listing 1: Create table statements for tables: cell, session, file and epos. Fields for each of these tables are documented in the comments. 



create table cell 
id integer, 
topdir string, 
animal string, 
ele integer, 
clu integer, 
region string, 
nexciting integer, 
ninhibiting integer 
exciting integer, 
inhibiting integer. 



Id used to match original row number in MatLab PyrlntMap . Map matrix 
top level directory containing data 
name of animal 
electrode number 
ID # in cluster files 
brain region 

-- number of cells this cell monosynaptically excited 
-- number of cells this cell monosynaptically inhibited 
-- physiologically identified exciting cells based on CCG analysis 
-- physiologically identified inhibiting cells based on CCG analysis 
-- (Detailed method in Mizuseki Sirota Pastalkova and Buzsaki., 2009 Neuron paper.) 
excited integer, -- based on cross-correlogram analysis, the cell is monosynaptically 

excited by other cells 

inhibited integer, -- based on cross-correlogram analysis, the cell is monosynaptically 
inhibited by other cells 

fireRate real, -- meanISI=mean (bootstrp (100 , 'mean' , ISI) ) ; fireRate = SampleRate/MeanlSI ; ISI is 
interspike intervals. 

totalFireRate real, -- num of spikes divided by total recording length 

cellType string -- ' p ' ^pyramidal , ' i ' =interneuron, 'n'=not assigned as pyramidal or 

interneuron 
) ; 



create table session ( 

id integer, -- matches row in original MatLab Beh matrix 

topdir string, -- directory in data set containing data (tar.gz) files 

session string, -- individual session name (corresponds to name of tar.gz file having data) 
behavior string, -- one of: Mwheel, Open, Tmaze, Zigzag, bigSquare, bigSquarePlus , 

-- linear, linearOne, linearTwo, midSquare, plus, sleep, wheel, wheel_home 
familiarity integer, -- number of times animal has done task, l=animal did task for first time, 

-- 2=second time, 3=third time, 10=10 or more 
duration real -- recording length in seconds 



create table file ( 

-- information about files in hc3 dataset 

topdir string, -- directory in data set containing data (tar.gz) files 

session string, -- individual session name (corresponds to name of tar.gz file having data) 

size integer, -- number of bytes in tar.gz file 

video_type string, -- 'mpg', 'mlv' or '-' (for no video file) 

video_size integer -- size of video file, or 0 if no video file 
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create table epos ( 

-- has electrode positions for each top level directory 
-- Note, some regions do not match that in cell table. 
-- Those that differ have following meanings: 

DGCA3 : not sure if the electrode is DG or CA3 . 

Ctx: somewhere in the cortex (above the hippocampus) 

CA: somewhere in the hippocampus (do not know if it is CAl , CA3 or DG) 
topdir string, -- directory in data set containing data (tar.gz) file 
animal string, -- animal name 
el string, -- region for electrode 1 

e2 string, -- region for electrode 2 

e3 string, -- region for electrode 3 

e4 string, -- region for electrode 4 

-- ... (e5 through el4 fields not shown) 
el5 string, -- region for electrode 15 

el6 string -- region for electrode 16 



Data availability 

CRCNS: Multiple single unit recordings from different rat hip- 
pocampal and entorhinal regions while the animals were performing 
multiple behavioral tasks, http://dx,doi.org/10.6080/K09G5JRZ 

Terms of data usage: Data on this site is made available only for 
scientific purposes. Redistribution of the data is not permitted. Any 
publications derived from the data must cite the data contributors 
and CRCNS.org as being the source of the data and the original 
paper(s) that generated the data. Unnecessary downloading of large 
data files is not permitted. (To minimize demands on the server, 
only data expected to be useful for your scientific purposes should 
be downloaded). 

Privacy notice: Occasionally the researchers who contribute data 
wish to know who has downloaded their data. Upon request we 
will provide this information to the data contributors. So, if you 
download data, there is a possibility that your name and email 
address will be provided to the data contributor. We request that the 
data contributors only use the information for legitimate scientific 
purposes (such as determining the frequency of downloads, or 
contacting users to providing updated information about the data or 
to explore possible collaborations). 
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Current Referee Status: 
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Referee Responses for Version 1 

James Knierim 

Zanvyl Krieger Mind/Brain Institute and Department of Neuroscience, Johns Hopkins University, 
Baltimore, MD, USA 

Approved: 07 May 2014 

Referee Report: 07 May 2014 
doi:1 0.5256/f 1 000research.41 71 .r4603 

This data set will be a valuable resource for investigators who wish to test hypotheses about hippocampal 
function and interaction with entorhinal cortex at the level of single unit and LFP physiology. I have not 
investigated the data base carefully to ensure its utility, as I assume the authors have done so. My 
comments are limited to a few questions about their FIOOOResearch article describing the data base. 

1 . 'Animal surgery' - second paragraph: 

Do the authors mean "in the coronal plane?" It is not clear whether the tetrodes were angled 
medially or laterally in that plane, or whether the authors mean that the tetrodes were angled 
anteriorly or posteriorly to the coronal plane. Please clarify here and in other locations in the text 
how the tetrodes were angled. 

2. 'Animal surgery' - final paragraph: 

Does the data base contain histological figures? If not, are they easily identifiable and accessible 
from published reports? It would be very useful to ensure that the precise location of tetrodes could 
be made available to investigators. 

3. Table ^, 'elevated linear track': 
Shouldn't the fixed angles be 74 degrees (i.e. +/- 37 degrees)? 

4. 'Behavioural testing' item (5): 

Were the rats motivated to run the corridors in any particular order? Was there any working 
memory component to the trials? 

I have read this submission. I believe that I have an appropriate level of expertise to confirm that 
it is of an acceptable scientific standard. 

Competing Interests: No competing interests were disclosed. 
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Andrew P. Maurer 

Division of Neural Systems IVIennory and Aging, University of Arizona, Tucson, AZ, USA 

Approved: 06 May 2014 

Referee Report: 06 IVlay 2014 

doi:1 0.5256/f 1 000research.41 71 .r4604 

IVIizuseki and colleagues provide a description of 442 datasets (more than 200 hours) of hippocampal in 
vivo recordings. These datasets provide cell classification as well as the raw data, in case users wish to 
return to the high-sample traces and re-cluster the data on their own. The results of different analyses 
from the database have been published, although all analysis possibilities have not been exhausted. 
While this is an atypical review to write (as any suggestions on improvement to the database seem to be 
akin to looking a gift horse in the mouth), I am hoping the authors can discuss the ramifications of 
providing such an extensive database. While any database has limitations (e.g. sometimes determining 
the EEG units is a bit of an exercise with these data in the present format), it is perhaps more 
advantageous and constructive to discuss what the community can do to make the most use of it. As the 
authors state "Community-driven data siiaring may offer cross-validation of findings, refinement of 
interpretations and facilitate discoveries" and the challenges are now directed to those who are interested 
in participating in future analyses. In hopes that this will set-forth a new era of data-sharing, I hope that the 
authors can discuss their open database in a manner that parallels Giorgio Ascoli's discussion on sharing 
neural reconstruction files (Ascoli, 2006). It should be noted that I do not expect the authors to have 
comprehensive answers to each of my comments below, but it might be beneficial if they would provide 
some initial thoughts to seed further discussion. Some points that the authors may wish to discuss 
include: 

1 . One barrier to sharing data is "the fear of being scooped" (Ascoli, 2006). For example, scientific 
progress will be dramatically increased through parallel (and hopefully, collaborative) data 
analysis. Are the authors concerned about being "scooped"? What about a group of researchers 
unknowingly using the database to conduct an analysis that is also a student's PhD project? This 
hypothetical student may still be acquiring the expertise to keep pace with more seasoned 
researchers. 

2. "An often unspoken resistance to the sharing. . . data is born out of concern for criticisms and 
mistakes" (Ascoli, 2006). This is an unprecedented event that Mizuseki and colleagues have set 
forth, by providing a comprehensive catalog of data in an unabashed manner. The first point I want 
to touch upon is the level of raw exposure in releasing data. I have never met a neuroscientist who 
believes that anyone else could've conducted their analyses better than themselves (save a 
modest few), but to find those who believe that the authors "didn't look for the right thing" could 
make up a small battalion. Mizuseki and colleagues invite critics to their doorstep. Perhaps this is 
more similar to posing nude as a model for artist. By placing data online, it comes with judgment 
and the potential to be proved wrong. What do the authors believe the convention is if others follow 
in their footsteps? For example, if group A shares their database after publishing their results, 
group B downloads and analyzes their data in new light and finds the opposite results, what is the 
appropriate manner in handling the situation? The self-correction aspect of science is also 
accelerated when data is openly-shared. It remains to be seen how situations like this should/will 
be handled. 

3. There is also a chance for the data to be used in order to stifle or impede publication when the 
result is dubious. That is, should it be considered "fair-play" for a reviewer to use the same 
database and conduct a similar analysis with results that contradict a submitted manuscript's 
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results? A tactic such as this only seems appropriate in "open-review formats". 



4. "A final barrier to sharing digital reconstructions relates to the reluctance to lose or give a 
competitive edge" (Ascoli, 2006). The release of this immense database will surely be the 
stronghold of many new assistant professors who are still in the initial stages of setting up a 
physiology laboratory. Moreover, I can see these data being used in laboratories that are heavily 
analysis driven and limited in their own capacity for high-density in vivo recordings. This increase in 
the number of people analyzing data invites competitors for the authors as well as their neutral 
peers. Sheer logic dictates that the authors are not afraid of competition (otherwise, why share the 
data? Please note that I am a cynic and have been taught by many reviewers that "scientific 
altruism" is as abundant as snarks and unicorns). Why should researchers not be afraid of others 
using this database to compete? Is it believed that these data will be used for collaboration? What 
can be done to emphasize collaboration across laboratories when using the database? How is 
authorship handled when multiple groups use these data? If I spend my time analyzing these data, 
only to have it published under a "group project name" (similar to Sir William Timothy Gowers' 
Polymath project), do I put it on my CV? 

5. The paper explicitly states that large downloads are prohibited. Does this mean that I should not 
download all the data? Is it OK to use these data for a class project? If so, is it more appropriate for 
the professor to disseminate it to the students or should the students make their own CRCNS 
account? Finally, more of an afterthought but along similar lines: should there be a collaborative 
processing code library that should be developed and maintained in parallel with the use of these 
data (similar to GitHub or SourceForge)? 

I absolutely do not expect the authors to have complete answers to these questions nor should they carry 
the sole responsibility of determining the general conventions of what constitutes use versus misuse, but I 
do think that it is worth hearing their general thoughts. As this is the largest and most comprehensive 
database of in vivo hippocampal and entorhinal physiology to become available to the general scientific 
community, there will be an immense ramification. Scientific replication/external validation is an 
immediate and positive application of this database. As I have cited Timothy Gowers above, I think it 
would be best to leave off with his opinion of open data-sharing and collaboration: "It feels as though this 
process is to normal research as driving is to pushing a car." ( 
http://gowers.wordpress.com/2009/02/01/questions-of-procedure/). 

As a field we have the opportunity to compete or collaborate. I hope that these data facilitate 
cross-laboratory collaboration where two groups are reticent to share their own data. For those that are 
interested in embracing the collaborative spirit, the CRCNS website also has a "marketplace" section 
where ideas and potential collaborations can be discussed. Finally, I applaud the authors for this 
unprecedented act of scientific altruism. I hope this will be a platform that accelerates our understanding 
of the entorhinal-hippocampal circuitry through collaboration. 

I have read this submission. I believe that I have an appropriate level of expertise to confirm that 
it is of an acceptable scientific standard. 

Competing Interests: No competing interests were disclosed. 




Shuzo Sakata 

Strathclyde Institute of Pharmacy and Biomedical Sciences, University of Strathclyde, Glasgow, UK 



Page 1 2 of 1 4 



FlOOOResearch 



FIOOOResearch 2014,3:98 Last updated: 11 JUL 2014 



Approved: 06 May 2014 

Referee Report: 06 May 2014 

doi:1 0.5256/f 1 000research.41 71 .r4607 

This is an important report for tine neuroscience community regarding huge data sets recorded from the 
dorsal hippocampus and dorsomedial entorhinal cortex of behaving rats. Because the authors have 
already published a number of papers with the data sets, I have no doubt that this paper and their shared 
data will contribute to the further advancement of this field. 

This manuscript is clearly written in details. This article can be improved by clarifying the following minor 
points in the section Data collection and cell-type classification: 

1. Cluster quality 

It is not clear whether the quality of single units was assessed objectively, for example, by measuring 
isolation distance. 

2. Cell-type classification 

Because now researchers including the authors can classify cell-types optogenetically, it would be better 
to comment on potential pitfalls of the classification procedure used here. For example, it is difficult to 
exclude the possibility that some of "principal" neurons can be interneurons. 

I have read this submission. I believe that I have an appropriate level of expertise to confirm that 
it is of an acceptable scientific standard. 

Competing Interests: No competing interests were disclosed. 
Yoshikazu Isomura 

Brain Science Institute, Tamagawa University, Tokyo, Japan 

Approved: 06 May 2014 

Referee Report: 06 May 2014 

doi:1 0.5256/f 1 000research.41 71 .r4602 

In this work, Mizuseki etal. provide valuable information on their large-scale data sets of multi-neuronal 
and local field potential (LFP) recordings from the hippocampus (dentate gyrus, CAS, and CA1) as well as 
the enthorhinal cortex across the layers (layers 2, 3, 4, 5 etc.) of behaving rats. These data sets have 
originally been used for a number of their excellent studies for past ten years. Here, they fully summarized 
the experimental conditions and results in individual recording sessions very clearly for any users to plan 
to analyze the data. It will be a really useful treasure map for all hippocampus researchers. 

I have one minor comment. While they showed the number of isolated neurons in each brain area of each 
animal (Table 2 to Table 4), it is not clear yet how many neurons, at most, were recorded simultaneously. 
Some people will probably want to analyze spike activity in a population of many neurons. The data would 
be more available for the higher-order spike analysis if the authors added additional information on the 
best (or first choice) data set with a sufficient number of well-isolated neurons in the same brain area. 
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I have read this submission. I believe that I have an appropriate level of expertise to confirm that 
it is of an acceptable scientific standard. 



Competing Interests: No competing interests were disclosed. 
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